Similarity-based network models and how evaluate them

نویسندگان

  • Robson Motta
  • Alneu de Andrade Lopes
چکیده

Similarity-based network models have been used in many data mining tasks, such as classification and clustering. These models are applied in non-relational data, where each example is represented by a vector of characteristics, creating a relational representation based on similarity among the examples. Using this representation, it is possible to use complex network measures, in a relational data mining context, incorporating more information than the traditional propositional data mining. In this technical report we present a new network model based on similarity, the Extended Minimum Spanning Tree network (EMST), and three measures to evaluate the network models, based on neighborhood, clusters and outliers preservation. The model proposed here is non-parametric and present good results for all evaluation measures when compared with the other models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Similarity measurement for describe user images in social media

Online social networks like Instagram are places for communication. Also, these media produce rich metadata which are useful for further analysis in many fields including health and cognitive science. Many researchers are using these metadata like hashtags, images, etc. to detect patterns of user activities. However, there are several serious ambiguities like how much reliable are these informa...

متن کامل

Link Prediction using Network Embedding based on Global Similarity

Background: The link prediction issue is one of the most widely used problems in complex network analysis. Link prediction requires knowing the background of previous link connections and combining them with available information. The link prediction local approaches with node structure objectives are fast in case of speed but are not accurate enough. On the other hand, the global link predicti...

متن کامل

Uncertainty Modeling of a Group Tourism Recommendation System Based on Pearson Similarity Criteria, Bayesian Network and Self-Organizing Map Clustering Algorithm

Group tourism is one of the most important tasks in tourist recommender systems. These systems, despite of the potential contradictions among the group's tastes, seek to provide joint suggestions to all members of the group, and propose recommendations that would allow the satisfaction of a group of users rather than individual user satisfaction. Another issue that has received less attention i...

متن کامل

Daily Pan Evaporation Estimation Using Artificial Neural Network-based Models

Accurate estimation of evaporation is important for design, planning and operation of water systems. In arid zones where water resources are scarce, the estimation of this loss becomes more interesting in the planning and management of irrigation practices. This paper investigates the ability of artificial neural networks (ANNs) technique to improve the accuracy of daily evaporation estimation....

متن کامل

Looking for Hyponyms in Vector Space

The task of detecting and generating hyponyms is at the core of semantic understanding of language, and has numerous practical applications. We investigate how neural network embeddings perform on this task, compared to dependency-based vector space models, and evaluate a range of similarity measures on hyponym generation. A new asymmetric similarity measure and a combination approach are descr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012